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Abstract 

Results are presented from a search for the pair-production of heavy quarks, QQ, that 
decay exclusively into a top quark and a W or Z boson. The search is performed using 
a sample of proton-proton collisions at y/s = 7 TeV corresponding to an integrated lu- 
minosity of 5.0 fb -1 , collected by the Compact Muon Solenoid experiment. The signal 
region is defined using a sample of events containing one electron or muon, missing 
transverse momentum, and at least four jets with large transverse momenta, where 
one jet is likely to originate from the decay of a bottom quark. No significant excess 
of events is observed with respect to the standard model expectations. Assuming a 
strong pair-production mechanism, quark masses below 675 (625) GeV decaying into 
tW (tZ) are excluded at the 95% confidence level. 
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1 Introduction 

New heavy quarks (Q) that decay to top quarks and electroweak bosons (W, Z, or Higgs 
bosons) appear in many theoretical scenarios, the simplest being the sequential fourth genera- 
tion model m. Experimental constraints on masses of the fourth-generation quarks |2]-|4l sug- 
gest that the dominant decay mode of the down-type fourth-generation quark is to a top quark 
and a W boson. There are also models (5HZI that include non-chiral heavy quarks with vector- 
like couplings to bosons. Such quarks cancel quadratically divergent corrections to the Higgs 
boson mass, and thereby stabilize it at the electroweak symmetry breaking scale. In these mod- 
els, the new quarks can also decay through flavor-changing neutral-current processes, such as 
to a top quark and a Z boson, or to a top quark and a Higgs boson. 

This Letter presents a search for the pair production of new heavy quarks Q that decay exclu- 
sively to a top quark and a W boson, or to a top quark and a Z boson. The current experimental 
constraints on masses of heavy quarks Q are set by previous LHC searches. If Q exists, its 
mass must be greater than 480 GeV, assuming the exclusive decay Q — > tW |8|, or greater than 
475 GeV, assuming the decay Q — > tZ [9|. Given these constraints, the mass splitting between 
the heavy quark Q and the top quark must be large, so that the decay products can be pro- 
duced on-shell. For heavy down-type quarks decaying exclusively into a top quark and a W 
boson, the full decay chain is QQ — > tW~tW + — >■ bW + W~bW W + , and for up-type quarks 
decaying exclusively into a t quark and a Z boson, it is QQ — > tZtZ — > bW + ZbW Z. The 
search is performed in events in which one of the W bosons (originating either from the decay 
of the heavy quark, or from the subsequent decay of a top quark) decays leptonically while 
the other bosons decay into quark-antiquark pairs. Selected events are required to have exactly 
one charged lepton, an imbalance in transverse momentum (pj ), and at least four jets with 
high transverse momenta (fx), at least one of which is consistent with the decay of a bottom 
quark. 

The dominant standard-model (SM) processes that result in the same signature include tt pro- 
duction, as well as production of W bosons with associated jets. These background processes 
are characterized by smaller lepton and jet transverse momenta and lower jet multiplicities 
than those in heavy quark decays. The search for heavy quarks is performed by classifying 
events based on the number of final-state jets. For each jet multiplicity, the scalar sum (St) of 
the transverse momenta of the lepton, the jets, and tfj are used to test for the presence of a new 
physics signal in the data. 

2 The CMS Detector and Data Samples 

The data were recorded during 2011 by the Compact Muon Solenoid (CMS) experiment at the 
LHC, which delivered proton-proton collisions at yjs = 7 TeV, and correspond to an integrated 
luminosity of 5.0 fb _1 . The CMS detector uses a polar coordinate system with the z axis point- 
ing along the counterclockwise circulating beam. The x axis points towards the center of the 
Large Hadron Collider (LHC) ring, and the y axis points up. Angular coordinates are specified 
by the pseudorapidity rj = — ln[tan(0/2)], where 6 is the polar angle measured with respect to 
the positive z axis, and by (p, the azimuthal angle about this axis. 

A characteristic feature of the CMS detector is its superconducting solenoid, which is 6 m in 
diameter and 13 m in length, and provides an axial magnetic field of 3.8 T. Located inside 
the solenoid is a multilayer silicon-pixel and silicon-strip tracker, covering the pseudorapidity 
region \rj\ < 2.5, the electromagnetic calorimeter (ECAL), covering \t]\ < 3.0 made of lead- 
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3 Event Reconstruction 



tungstate crystals, the preshower detector covering 1.65 < \rj\ < 2.6, and the hadronic calori- 
meter (HCAL) made of brass and scintillators, covering \t]\ < 3.0. Muons are measured with 
gas-ionization detectors embedded in the return yoke of the solenoid, which cover \tj\ < 2.4, 
and complement the measurement in the inner tracking detector. The CMS detector is nearly 
hermetic, allowing for momentum balance measurements in the plane transverse to the beam 
direction. A two-tier trigger system selects the most interesting pp collision events for use in 
physics analysis. A detailed description of the CMS detector is given in Ref. IITOl . 

The e+jets events were collected with triggers requiring at least one electron candidate, with a 
pj threshold ranging from 25 to 32 GeV. When the LHC instantaneous luminosity increased, 
at least three jets with pj > 30 GeV were also required. The /^+jets events were collected with 
triggers that required at least one muon candidate with a pj threshold ranging between 30 
and 40 GeV. The presence of jets was not required in the triggers for the f/+jets events, and no 
conditions were placed on the tfj for either channel. 

The background processes tt+jets, W+jets, and Z+jets, are simulated using the MadGraph 5.1.1 
event generator [11] with CTEQ6L1 [12J parton distribution functions (PDF). The single top 
quark production via tW, s and t channels is simulated using the POWHEG 1.0 event genera- 
tor fl5HI5l with CTEQ6M PDF |J2|. The multijet and diboson processes (WW, WZ, and ZZ) are 
generated using the pythia 6.424 event generator £[61 with CTEQ6M PDF. pythia is also used 
to model the parton shower and hadronization for both MADGRAPH and the POWHEG Monte 
Carlo (MC) samples. The generated events are processed through a CMS detector simulation 
based on Geant4 |T7[. Additional minimum-bias events (pileup) are generated with PYTHIA 
and superimposed on the hard-scattering events to simulate multiple collisions within the same 
bunch crossing. All the MC simulated events are weighted to reproduce the distribution of the 
number of interaction vertices observed in data. 

3 Event Reconstruction 

Events are reconstructed using the CMS particle-flow (PF) algorithm [18-20], which identifies 
all observable particles in an event by combining the information from charged particles in 
the silicon tracker, energy deposited in the ECAL and HCAL, and signals in the preshower 
detector and the muon systems. This procedure separates all particles into five categories: 
muons, electrons, photons, and charged and neutral hadrons. Energy calibration is performed 
separately for each particle type. The missing transverse momentum in an event is defined as 
the negative vector sum of the transverse momenta of all objects from the PF algorithm. Events 
must also have an acceptable primary vertex, and we select the vertex with the largest value 
for the scalar sum of the p\ of the associated tracks. 

Electron candidates are reconstructed from clusters of energy deposited in the ECAL. The clus- 
ters are first matched to track seeds in the pixel detector. The track trajectories of electron 
candidates are reconstructed using a dedicated modeling of the electron energy loss, and fitted 
with a Gaussian-sum filter l|2"Tf . 

Muon candidates are identified through different reconstruction algorithms using hits in the 
central silicon tracker and signals in the outer muon system (22| . A standalone muon algorithm 
uses only information from the muon chambers. The tracker muon algorithm begins with 
tracks found in the inner tracker, and associates these with matching segments in the muon 
chambers. In this analysis, all muons have to pass the global muon algorithm, which starts 
off with the standalone muons and then performs a global fit to the hits in the tracker and the 
muon system for each muon candidate. 
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Jets are reconstructed using the anti-fcx jet clustering algorithm l|23~t with a distance parameter 
R = 0.5, as implemented in FASTJET version 2.4 lj24Tl27| . Jets are identified as originating from 
the decay of a bottom quark through the combined secondary vertex (CSV) algorithm at the 
medium operating point [28|. The CSV algorithm provides optimal b-tagging performance by 
combining information on the impact parameter significance, the properties of the secondary 
vertex, and the jet kinematics. The variables are combined using a likelihood-ratio technique 
to compute a b-tagging discriminant. The residual differences in the performance of the b- 
tagging algorithm between data and simulation are accounted for by pj- and //-dependent 
data /simulation scale factors |28| . 

4 Event Selection 

Charged leptons from the decay of W bosons are typically well isolated from jets. The lep- 
ton isolation can be expressed in terms of the quantity I~, defined as the scalar sum of the 
transverse momenta of charged hadrons, neutral hadrons and photons in a cone of AR = 
\J (A^) 2 + (A?/) 2 < 0.3 around the lepton momentum vector, divided by the lepton pj. The 
isolation requirements are optimized to be If < 0.1 for electrons, and if < 0.125 for muons. 
The electrons (muons) also must have p| > 35GeV (p T > 42GeV), and \t] e \ < 2.5 ((//'' | < 2.1). 
The lepton trajectories are required to have a magnitude of the transverse impact parameter less 
than 0.02 cm and a magnitude of the longitudinal impact parameter along the beam direction 
less than 1 cm relative to the primary vertex. 

The final selection requires events to have exactly one isolated lepton and at least four jets 
with \rj | < 2.4 and pj > 100, 60, 50, 35GeV. Additional jets having p T > 35GeV are also 
counted. The minimum number of jets, and the jet pi requirements are optimized to enhance 
the sensitivity to the QQ signal. The thresholds for lepton p l T and the third jet pj are driven by 
trigger conditions. Jets that are within a cone of AR < 0.3 of the lepton direction are ignored. 
At least one jet must be b-tagged by the CSV algorithm. The event is also required to have 
p'j > 20 GeV. 

Table [TJ lists the number of events observed and the number expected for the background 
sources, following all selections. The cross section for tt production is taken from Ref. [29|. The 
single top quark cross sections are approximate NNLO calculations obtained from Ref. l3"0T(3"2"| . 
The cross sections for W+jets and Z+jets are computed to NNLO using fewz 1133 1 . The cross 
sections for the diboson processes WW, WZ, and ZZ are calculated to NLO using MCFM [34|. 
The expected number of SM background events is evaluated based on the cross sections given 
in Table [TJ the corresponding efficiencies and acceptances for each background, and integrated 
luminosity, with exception of the contributions from multijet processes, which are estimated 
from data. We perform a fit of SM contributions, normalized as described above, to the p'j 
distribution in data. In the fit, we constrain the tf+jets contribution within its measured uncer- 
tainty, while Z+jets, single top and diboson production processes are constrained within their 
theoretical uncertainties. The normalizations for the W+jets and multijet contributions are al- 
lowed to float freely. We obtain multijet scale factors as a function of the lepton rj and use them 
to correct for the multijet normalization from simulation. For ^+jets, the multijet background 
using this technique is found to be negligible. 

Table [2] presents the QQ production cross sections that are computed at approximate NNLO 
using HATHOR 11351 , along with the expected number of events for the e+jets and f/+jets chan- 
nels. 
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4 Event Selection 



Table 1: Background cross sections, expected number of background events and observed num- 
ber of events in 5.0 fb data, for the e+jets and }i+)ets samples prior to the likelihood fit. The 
uncertainties on the expected number of events reflect only the statistics of the simulation. The 
quoted uncertainties on the cross sections are theoretical or measured in the case of tt+jets. 



Background process 


Cross section 


e+jets events 


/^+jets events 


tt+jets 


154 ± 19 pb 


7521 ± 38 


7190 ± 37 


Single top 


84.9 ± 2.5 pb 


399 ±4 


391 ±4 


W+jets 


31 ± 2 nb 


798 ± 16 


790 ± 16 


Z+jets 


3.1 ± 0.3 nb 


104 ±3 


63 ±2 


Diboson (WW, WZ, ZZ) 


67 ± 2 pb 


17 ±1 


15 ±1 


Multijet 


from data 


334 ±8 




Total background 




9173 ± 42 


8449 ± 44 


Data 




9105 


8211 



Table 2: Signal cross sections 113511 and expected number of QQ signal events in the e and ji 
channels for four quark masses. The uncertainties reflect the statistics of the MC simulations. 



M Q (GeV) 


a (pb) 


Q^tW 


Q -> tW 


Q -> tZ 


Q -> tZ 






e+jets 


^+jets 


e+jets 


^+jets 


500 


0.33 


136.9 ± 1.4 


137.0 ± 1.4 


91.2 ±1.2 


85.9 ± 1.1 


550 


0.17 


74.6 ± 0.8 


74.9 ± 0.8 


49.1 ± 0.6 


46.7 ±0.6 


600 


0.092 


41.2 ±0.4 


41.7 ±0.4 


27.1 ± 0.3 


26.1 ± 0.3 


650 


0.051 


22.9 ± 0.2 


23.4 ± 0.2 


15.9 ±0.3 


15.1 ±0.3 
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5 Likelihood Fit and Systematic Uncertainties 

The search for a QQ signal is performed by fitting the data to the distribution of Sj as a function 
of jet multiplicity (Nj). The fit is performed for the combination of e+jets and ^+jets channels 
and for Nj = 4, 5, 6, and > 7 jets. The bins are chosen so that the MC statistical uncertainty in 
each bin is less than 17%. 

The dominant SM background is from tt production. Because the jet multiplicity for the tt+jets 
events is not well modeled, the tt contributions for Nj = 4, 5, 6, and > 7 are allowed to vary 
independently in the fit. A log-normal constraint is imposed on the expected yield of tt events 
for each jet multiplicity with an uncertainty determined from the difference that results from 
changing the renormalization and factorization scales by a factor of two relative to the nominal 

value equal to Q = ■Jrr^ + Pjjet ' wnere the sum is taken over jets produced in association 

with the tf pair. The inclusive top quark pair production cross section cr^ and its uncertainty 
are taken from the recent CMS measurement [29]. The uncertainty on cr fi is used in the fit as a 
log-normal constraint correlated between different jet multiplicities. 

Other SM contributions include electroweak processes: W+jets, Z+jets, single top quark, and 
diboson production, as well as multijet events. An uncertainty of 50% is assigned to the nor- 
malization of the sum of these backgrounds. They are combined into a single background 
template, and and the sum of their contributions is allowed to vary independently across each 
jet multiplicity bin. 

The luminosity is constrained to a log-normal distribution with an uncertainty of 2.2% [36|. The 
electron and muon trigger and identification efficiencies are obtained from data using dilepton 
decays of Z bosons. A conservative systematic uncertainty of 3.5% is attributed to account for 
pileup and lepton tj dependence. These efficiencies together with their uncertainties are treated 
as normalization constraints, and applied to the electron and muon events respectively. 

In addition to constraints on normalization, there are other parameters that affect both the nor- 
malization and the shape of Sj and the jet multiplicity spectra. These include the jet energy 
scale, the b-tagging efficiency, the matching between matrix element partons and parton show- 
ers and the renormalization and factorization scales. We incorporate these uncertainties in the 
fit by generating additional templates corresponding to shifts by ± 1 standard deviation on 
the parameter in question. The energies of the jets are corrected using the calibration constants 
determined in Ref. 11371 as a function of pj and t]. The uncertainties on b-tagging efficiencies 
are estimated by changing the b-tagging efficiency by ±1 of its standard deviation [28|. The 
uncertainty due to the choice of factorization and renormalization scales is estimated by simu- 
lating two sets of tt samples in which both scales are increased or decreased by a factor of two 
relative to their nominal value. The uncertainty arising from matching matrix element partons 
with parton showers is estimated using two tf simulated samples, with matching threshold 
shifted up or down by a factor of two relative to its default value (40GeV). Other sources of 
systematic uncertainties, such as jet energy resolution, flj resolution and pileup interactions 
have negligible impact on the limit for a QQ signal. 

Systematic uncertainties enter the likelihood through "nuisance" parameters [38 1, that reflect 
the presence of imprecisely determined quantities that affect the Sj and jet multiplicity distri- 
butions. These are represented by resolution functions contained within the likelihood func- 
tion, and are integrated over in the process of minimization, resulting in a reduction in the final 
systematic uncertainty. 

Table[3]summarizes the systematic uncertainties included in the fit to Sj and Nj. Parameters la- 
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5 Likelihood Fit and Systematic Uncertainties 



Table 3: List of systematic uncertainties included in the likelihood fit. Parameters labeled 
"Distribution" affect both shape and normalization of the Sj and Nj distributions. The quoted 
uncertainties correspond to their effect on normalization only. 



Parameter type Source Uncertainty (%) 

Q 2 scales for tt+jet 8^7 

, ,. Matching partons 5.8 

Distribution T . , r * 

Jet energy scale 5.4 

b-tagging efficiency 5.1 

Lepton ID/ reco/ trigger 3.5 

Luminosity 2.2 

tt cross section 12 

Normalization Nj e t s = 4 3.5 

N jets = 5 16 

Njets = 6 23 

Njets > 7 22 

Other backgrounds 50 



beled "Distribution" affect both shape and normalization of the Sj and Nj distributions, where 
only their effect on background normalization is quoted. Parameters labeled "Normalization" 
affect only the normalization of SM backgrounds and/ or new physics signal. 

The Sj distributions for different jet multiplicities are combined, and shown in Fig. [TJ after the 
maximum-likelihood fit to data. No excess over the predicted SM background is observed, and 
we proceed to set an upper limit on the QQ cross section. The upper limit is extracted using a 
frequentist CL S technique [39. 40| with an asymptotic approximation. The following likelihood 
ratio is used as a test statistic: 

( L(x\cr,v a )/L(x\cr,v) if cr > & 
II if a < cr, 

where L(x\o~, v) is the likelihood that x is observed in data, given a hypothesized value of the 
QQ cross section and "nuisance" parameters v. The values of a and v for which the likelihood 
reaches its maximum value are denoted a and v, respectively. The symbol v a refers to the values 
of the parameters v that maximize the conditional likelihood for any given value of cr. 

The probability to observe a value of t for the likelihood ratio that is larger than the observed 
value f bs is determined using pseudo-experiments in which the expected numbers of signal 
and background events are allowed to vary according to their statistical and systematic un- 
certainties. For pseudo-experiments generated assuming a background-only hypothesis, the 
probability is denoted by CLb. For pseudo-experiments assuming background plus signal with 
a cross section cr, the probability is denoted by CL s+ b (cr). The 95% confidence level (CL) upper 
limit for the QQ cross section is the value of a for which the ratio of CL s+ b (cr) and CLb, denoted 
as CL S , is 0.05. 

We also verify that the negative log likelihood is minimized at the value corresponding to the 
global minimum. We evaluate the likelihood as a function of the v parameters and check for 
secondary minima. No such minima are observed, and the data constrain the nuisance pa- 
rameters well within their a priori assumptions. Following the maximization of the likelihood, 
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the dominant uncertainty is from the matching between matrix element partons and parton 
showers. To quantify this effect, the likelihood minimization is performed excluding the par- 
ton matching systematic uncertainty. This results in reduction of the upper limit on the signal 
cross section for Mq = 600 GeV by 5%, and an increase in the exclusion limit by 15 GeV. This 
procedure provides an estimate of the impact of this uncertainty. 
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Figure 1: Distribution in Sj for different jet multiplicities after the maximum-likelihood fit to 
data. The last bin in each subfigure is the overflow bin.The bottom plot shows the ratios of 
data and SM plus signal over SM. Q — > t W and Q — > tZ distributions are shown for illustrative 
purposes for Mq = 500 GeV. 



6 Results 

Figure [2] shows the observed and expected 95% CL upper limit on the QQ production cross 
section, Cqq, for a down-type heavy quark decaying exclusively to tW. The lower mass limit is 
determined by the value at which the observed upper limit curve for c^q crosses the theoretical 
expectation. The observed (expected) limit corresponds to 675 (625) GeV. 

Figure[3|shows the observed and expected 95% CL upper limit on the as a function of quark 
mass (Q) for an up-type heavy quark decaying exclusively to tZ. The observed (expected) limit 
corresponds to 625 (550) GeV. 

Several cross checks have been performed to investigate the difference between the observed 
and expected limits. We studied several models of the tt Sj spectrum by using different genera- 
tors, such as PYTHIA and POWHEG. All of the generators provide results similar to MadGraph 
within their systematic uncertainties. We also studied different models of the tt St spectrum 
by changing internal parameters in MADGRAPH, such as the renormalization and factorization 
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7 Summary 



scales and the parameters responsible for matching jets originating from matrix element par- 
tons to their showers. We determine that a change of the matching parameters by one standard 
deviation from their nominal values provides good agreement between the simulated and ob- 
served spectrum of the Sj distribution, which can be accommodated in the fit because of the 
relatively weak dependence of the minimum on this parameter. The dependence of the Sj dis- 
tribution for tt background is covered by the systematic uncertainties included in the fit of the 
model to data. 

CMS is = 7 TeV L = 5.0 fb" 1 

a. 

|S 
o 

t 

a 
a. 

D 

lo- 1 



450 500 550 600 650 700 750 

M Q (GeV) 

Figure 2: The observed (solid line with points) and the expected (dashed line) 95% CL upper 
limits on the QQ production cross section as a function of the heavy quark mass, Mq, com- 
pared to the theoretical QQ cross section for the case of a down-type heavy quark decaying 
exclusively to tW. 



7 Summary 

A search for pair-produced new heavy quarks QQ decaying exclusively to tWfW or to tZtZ is 
performed in lepton + jets events. The analysis is based on a data sample of proton-proton 
collisions at ^/s = 7TeV corresponding to an integrated luminosity of 5.0 fb -1 . Events are 
selected requiring an electron or a muon, missing transverse momentum, and at least four jets, 
one of which is identified as a bottom jet. A combined fit is performed to the scalar sum of 
the transverse momenta of all final reconstructed objects as a function of jet multiplicity. No 
significant deviations from SM expectations are found, and upper limits on the production 
cross section of QQ as a function of a heavy quark mass are computed. Assuming a strong 
production mechanism for both signal models, down-type quarks decaying exclusively to tW 
with masses below 675 GeV and up-type quarks decaying exclusively to tZ with masses below 
625 GeV are excluded at 95% CL. These are the most stringent limits to date. 
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